Exploring and measuring non-linear correlations: Copulas, Lightspeed Transportation and Clustering
نویسندگان
چکیده
We propose a methodology to explore and measure the pairwise correlations that exist between variables in a dataset. The methodology leverages copulas for encoding dependence between two variables, state-of-the-art optimal transport for providing a relevant geometry to the copulas, and clustering for summarizing the main dependence patterns found between the variables. Some of the clusters centers can be used to parameterize a novel dependence coefficient which can target or forget specific dependence patterns. Finally, we illustrate the methodology with financial time series (credit default swaps, stocks, foreign exchange rates). Code and numerical experiments are available online at https://www.datagrapple.com/Tech for reproducible research.
منابع مشابه
Efficient estimation of a semiparametric dynamic copula model
Outline Introduction Semi-parametric dynamic copula Motivation Local likelihood estimation Variance of the estimator Bias of the estimator Bandwidth selection Estimation of joint likelihood Modeling of marginal distributions Simulations and applications Simulations Empirical example Conclusions Problems and Solutions Problems Modeling dependence is critical for financial time series Model volat...
متن کاملModeling linearly and non-linearly dependent simulation input data
Input modeling software tries to fit standard probability distributions to data assuming that the data are independent. However, the input environment can generate correlated data. Ignoring the correlations might lead to serious inaccuracies in the performance measures. In the past few years, several dependence modeling packages with different properties have been developed. In our dissertation...
متن کاملExtraction and 3D Segmentation of Tumors-Based Unsupervised Clustering Techniques in Medical Images
Introduction The diagnosis and separation of cancerous tumors in medical images require accuracy, experience, and time, and it has always posed itself as a major challenge to the radiologists and physicians. Materials and Methods We Received 290 medical images composed of 120 mammographic images, LJPEG format, scanned in gray-scale with 50 microns size, 110 MRI images including of T1-Wighted, T...
متن کاملFUZZY GOAL PROGRAMMING TECHNIQUE TO SOLVE MULTIOBJECTIVE TRANSPORTATION PROBLEMS WITH SOME NON-LINEAR MEMBERSHIP FUNCTIONS
The linear multiobjective transportation problem is a special type of vector minimum problem in which constraints are all equality type and the objectives are conicting in nature. This paper presents an application of fuzzy goal programming to the linear multiobjective transportation problem. In this paper, we use a special type of nonlinear (hyperbolic and exponential) membership functions to ...
متن کاملMeasuring the efficiency of a three-stage network using data envelopment analysis approach considering dual boundary
This paper presents a method for performance evaluation, ranking and clustering based on the double-frontier view to analyze the complex networks. The model allows us to open the structure of the “black box” and can help to obtain important information about efficient and inefficient points of the system. In this paper, we consider a three-stage network, in respect to the additional desirable a...
متن کامل